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MULTI -CHANNEL FREQUENCY -DOMAIN ADAPTIVE 
FILTER METHOD AND APPARATUS 

Field of the Invention 

The present invention pertains to adaptive filtering in 
multi-channel environments. More particularly, the invention 
has specific application to multi-channel acoustic echo 
cancellation such as in stereophonic and other multi-channel 
teleconferencing systems. 

Background of the Invention 

The evolution of teleconferencing to a more lifelike and 
transparent audio/video medium depends upon, among other 
things, the evolution of teleconferencing audio capabilities. 
The more realistic the sound, the more lifelike a 
teleconference will be and the more persons and businesses 
will use it. Some present-day teleconferencing systems have 
already evolved to the point of including high-fidelity audio 
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systems (100-7000 Hz bandwidth) . These systems provide a 
significant improvement over older telephone systems (200-3200 
Hz bandwidth) . However, such high fidelity systems are by no 
means the limits of audio evolution in teleconferencing. 

Spatial realism is highly desirable for audio/video 
teleconferencing. This is because of the need of a listener 
to follow, for example, a discussion among a panel of dynamic, 
multiple, and possibly simultaneous talkers. The need for 
spatial realism leads to consideration of multi-channel audio 
systems in teleconferencing, which, at a minimum, involves two 
channels (i.e., stereophonic). 

Many present-day teleconferencing systems have a single 

(monophonic) full-duplex audio channel for voice 
communication. These systems, which range from simple 
speaker-phones to modern video teleconferencing systems, 
typically employ acoustic echo cancellers (AECs) to remove 
undesired echos that result from acoustic coupling. This 
coupling results when sound emitted from the teleconference 
loudspeaker (in response to a signal from a remote location), 
arrives at the teleconference microphone in the same room 

(i.e., the echo). The microphone generates a signal in 
response to this sound that is returned to the remote room in 
which it was originally generated. An AEC employs an adaptive 
filter to estimate the impulse response from the loudspeaker 
to the microphone in a room in which an echo occurs and to 
generate a signal to be subtracted from the receiver signal to 
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cancel that echo electrically* Like monophonic 

teleconf erencing, high-quality stereophonic teleconferencing 

requires AEC. 

Stereophonic AEC presents a problem which does not exist 
in the monophonic context. In monophonic teleconferencing 
systems, a single adaptive filter is used to estimate a single 
impulse response from the loudspeaker to the microphone in the 
room experiencing an echo* There is only one impulse response 
to estimate because there is only one loudspeaker and one 
microphone in the room. As the adaptive filter impulse 
response estimate approaches the true impulse response of the 
room, the difference between these responses approaches zero. 
Once their difference is very small, the effects of echo are 
reduced. The ability to reduce echo is independent of the 
signal from the loudspeaker, since the real and estimated 
impulse responses are equal (or nearly so) and both the room 
(with its real impulse response) and the adaptive filter (with 
its estimated impulse response) are excited by the same 
signal . 

In multi-channel stereophonic teleconferencing systems, 
multiple (e.g., two) adaptive filters are used to estimate the 
multiple (e.g., two) impulse responses of the room. Each 
adaptive filter is associated with a distinct acoustic path 
from a loudspeaker to a microphone in the receiving room. 
Rather than being able to independently estimate the 
individual impulse responses of the room, conventional 
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stereophonic AEC systems derive impulse responses which have a 
combined effect of reducing echo. This limitation on 
independent response derivation is due to the fact that the 
AEC system can measure only a single signal per microphone. 
This signal is the sum of multiple acoustic signals arriving 
at a single microphone through multiple acoustic paths. Thus, 
the AEC cannot observe the individual impulse responses of the 
room. The problem with deriving impulse response estimates 
based on the combined effect of reduced echo is that such 
combined effect does not necessarily mean that the actual 
individual impulse responses are accurately estimated. When 
individual impulse responses are not accurately estimated, the 
ability of the AEC system to be robust to changes in the 
acoustic characteristics of the remote location is limited, 
commonly resulting in undesirable lapses in performance. 

Figure 1 presents a schematic diagram of a conventional 
stereophonic (two-channel) AEC system in the context of stereo 
teleconferencing between two locations. A transmission room 1 
is depicted on the right of the figure. Transmission room 1 
includes two microphones 2, 3 which are used to pick up 
signals from an acoustic source 4 (e.g. , a speaking person) 
via two acoustic paths that are characterized by the impulse 
responses gi(t) and g 2 (t). (For clarity of presentation, all 
acoustic paths are assumed to include the corresponding 
loudspeaker and/or microphone responses.) Output from 
microphones 2, 3 are stereophonic channel source signals x 2 {t) 
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and x 1 {t) , respectively. These stereophonic channel source 
signals, x 2 (t) and x 1 (t) , are then transmitted via a 
telecommunications network (such as a telephone or an ATM 
network) to loudspeakers 11, 12 in a receiving room 10 (shown 
on the left) . For convenience, this direction will herein be 
termed the upstream direction and transmissions in the 
opposite direction, i.e., from room 10 to room 1, will be 
termed the downstream direction. The terms upstream and 
downstream are not intended to be limiting and have no 
particular connotation other than to differentiate between two 
directions. Loudspeakers 11, 12 are acoustically coupled to 
microphone 14 in receiving room 10 via the paths indicated 
with impulse responses h a (t) and h 2 (t). These are the paths by 
which acoustic echo signals arrive at microphone 14. 

The output of the microphone 14 is signal y(t), which is 
a signal representing acoustic signals in the receiving room 
impinging on the microphone. These acoustic signals include 
the acoustic echo signals as well as any signals independently 
generated in the room (such as by a speaking person) . 
Loudspeakers 11, 12 are also acoustically coupled to 
microphone 13 by other acoustic paths. For clarity of 
presentation, however, only the coupling to microphone 14 and 
AEC with respect to its output will be discussed. 

Further, those of ordinary skill in the art will 
recognize that the analysis concerning AEC for the output of 
microphone 14 is applicable to the output of microphone 13 as 
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well. Similarly, those skilled in the art will recognize that 
AEC as performed for the outputs of microphones 13 and 14 in 
receiving room 10 also may be advantageously performed for the 
outputs of microphones 2 and 3 in transmitting room 1, wherein 
the functions of receiving room 10 and transmitting room 1 are 
swapped. 

If nothing were done to cancel the acoustic echo signals 
in receiving room 10, these echoes would pass back to 
loudspeaker 5 in transmission room 1 (via microphone 14 and 
the telecommunications network) and would be circulated 
repeatedly, producing undesirable multiple echoes, or even 
worse, howling instability. This, of course, is the reason 
that providing AEC capability is advantageous. 

Conventional AECs typically derive an estimate of the 
echo with use of a finite impulse response (FIR) filter with 
adjustable coefficients. This "adaptable" filter models the 
acoustic impulse response of the echo path in the receiving 
room 10. Figure 1 generally illustrates this technique with 
use of AEC 20 using two adaptive FIR filters 16, 15 having 
impulse responses, Ai(t) and h 2 [t) , respectively, to model the 
two echo paths in the receiving room 10* Filters 16, 15 may 
be located anywhere in the system (i.e.,- at the transmitting 
room 1, in the telecommunications network, or at the receiving 
room 10), but are preferably located at the receiving room 10. 

Driving these filters 16, 15 with the upstream 
loudspeaker signals x t {t) and x 2 (t) produces signals y 2 (t) and 
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y 2 (t), which are components of a total echo estimate. The sum 
of these two echo estimate component signals yields the total 
echo estimate signal, y(t), at the output of summing circuit 
17. This echo estimate signal, y(t), is subtracted from the 
downstream signal y(t) by subtraction circuit 18 to form an 
error signal e(t). Error signal e(t) is intended to be small 
[i.e., driven towards zero) in the absence of near-end speech 
(i.e., speech generated in the receiving room). 

In most conventional AEC applications, the coefficients 
of adaptive filters 15, 16 are derived using well-known 
techniques, such as the familiar IMS (or stochastic gradient) 
algorithm. The coefficients are updated in an effort to 
reduce the error signal to zero. As such, the coefficients 
h x (t) and h 2 (t) are a function of the stereophonic signals 
x 2 (t) and x 1 {t) and the error signal, e(t). 

As mentioned above, unlike monophonic AECs, conventional 
stereophonic AECs do not independently estimate the individual 
impulse responses of a room. Rather, conventional 
stereophonic AEC systems derive impulse responses which have a 
combined effect of reducing echo. Unless individual impulse 
responses are accurately estimated, the ability of the AEC 
system to be robust to changes in the acoustic characteristics 
of the remote location is limited and undesirable lapses in 
performance may occur. 

This problem is discussed fully in U.S. Patent 
Application No. 09/395,834, entitled A Frequency Domain 
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Stereophonic Acoustic Echo Canceller Utilizing Non-Linear 
Transformation, which is incorporated herein by reference. 

Not only must the adaptation algorithm of filters 15, 16 
track variations in the receiving room, it must also track 
variations in the transmission room. The latter variations 
are particularly difficult to track. For instance, if one 
talker stops talking and another starts talking at a different 
location in the room, the impulse responses, g 2 and g 2 , change 
abruptly and by very large amounts. 

J. Benesty, A. Gilloire, Y. Grenier, A frequency domain 
stereophonic acoustic echo canceller exploiting the coherence 
between the channels, Acoustic Research Letters Online, 21 
July 1999, discloses a frequency domain algorithm for use in a 
stereophonic echo canceller that exploits the coherence 
between the channels. 

As can be seen from the above discussion, therefore, the 
challenge is to devise an approach which (as in the case of a 
single-channel echo canceller) converges independently of 
variations in the transmission room. Thus, it is desirable to 
de-correlate x t and x 2 . 

U.S. Patent Application No. 09/395,834 discloses a 
teleconferencing system that de-correlates the channel signals 
in a multi-channel teleconferencing system. Figure 2 is a 
schematic diagram of a stereophonic teleconferencing system 
that includes circuitry for de-correlating x^ and x 2 in 
accordance with the teachings of U.S. Patent No. 5,828,756. 
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The system of Figure 2 is identical to that of Figure 1 
except for the presence of non-linear signal transformation 
modules 25, 30 (NL) , which have been inserted in the paths 
between microphones 3, 2 of transmission room 1 and 
loudspeakers 11, 12 of receiving room 10. By operation of 
non-linear transformation modules 25, 30, stereophonic source 
signals x t {t) and x 2 (t) are transformed to signals x/ (t) and 
x 2 '(t), respectively, where "'" indicates a transformed signal 
which (in this case) advantageously has a reduced correlation 
with the other transformed signal of the stereophonic system. 

As with the system presented in Figure 1, the filters of 
AEC 20 may be located anywhere within the system, but are 
preferably located at receiving room 10. Non-linear 
transformation modules 25, 30 also may be located anywhere (so 
long as receiving room 10 and AEC 20 both receive the 
transformed signals as shown) , but are preferably located at 
transmitting room 1. 

Specifically, in accordance with one embodiment of the 
device disclosed in U.S. Patent No. 5,828,756, the signals 
x 1 {t) and x 2 (t) are advantageously partially de-correlated by 
adding to each a small non-linear function of the 
corresponding signal itself. It is well-known that the 
coherence magnitude between two processes is equal to one (1) 
if and only if they are linearly dependent. Therefore, by 
adding a "noise" component to each signal, the coherence is 
reduced. However, by combining the signal with an additive 
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component which is similar to the original signal, the audible 
degradation may be advantageously minimized, as compared to 
the effect of adding, for example, a random noise component. 
This is particularly true for signals such as speech, where 
the harmonic structure of the signal tends to mask the 
distortion. 

Thus, a linear relation between (t) and x 2 ' (t) is 
avoided, thereby ensuring that the coherence magnitude will be 
smaller than one. Such a transformation reduces the coherence 
and hence the condition number of the covariance matrix, 
thereby improving the misalignment* Of course, the use of 
this transformation is particularly advantageous when its 
influence is inaudible and does not have any deleterious 
effect on stereo perception. For this reason, it is 
preferable that the multiplier, a, be relatively small. 

In one illustrative embodiment of U.S. Patent No. 
5,828,756, the non-linear functions f x and f 2 as applied by 
non-linear function module 32 are each half-wave rectifier 
functions . 

The above-described solution proposed in U.S. Patent No. 
5,828,756 is a simple and efficient solution that overcomes 
the above-discussed problems by adding a small non-linearity 
into each channel. The distortion due to the non-linearity is 
hardly perceptible and does not affect the stereo effect, yet 
reduces inter-channel coherence, thereby allowing reduction of 
misalignment * to a low level. However, because the introduced 
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distortion is so small (so as not to significantly affect 
sound quality) , the echo cancellation algorithm must be very 
powerful in order to converge to a solution within a 
reasonably small period of time when conditions in the room 
change* A least mean squares (LMS) solution does not converge 
fast enough • A much more powerful algorithm is necessary in 
order to make the system illustrated in Figure 2 work. 

The aforementioned U.S. Patent Application No. 09/395,834 
discloses an acoustic echo canceller that exploits the 
coherence between multiple channels in the system illustrated 
in Figure 2. Particularly, it discloses one efficient 
frequency-domain adaptive algorithm used in the echo canceller 
circuits . 

Summary of the Invention 

The invention is a multiple channel adaptive filtering 
method and apparatus that is particularly suitable for use for 
acoustic echo cancellation in a multi-channel teleconferencing 
system, but has much broader application to any type of 
adaptive filtering environment. The method and apparatus 
exploits a new frequency-domain adaptive algorithm by using a 
frequency-domain recursive least squares criterion that 
minimizes the error signal in the frequency-domain. In 
accordance with the method and apparatus, an exact adaptive 
algorithm based on the normal equation is provided. Further, 
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in order to reduce the complexity of the algorithm, a 
constraint is removed resulting in an unconstrained frequency- 
domain least mean squares (UFLMS) algorithm. 

The invention further includes a method and apparatus for 
selecting an optimal step size for the UFLMS . Most 
importantly, the algorithm is generalized to the multiple 
channel case thereby exploiting the cross-power spectra among 
all the channels which allows for a fast convergence rate in 
the multiple channel acoustic echo cancellation environment 
where the input signals are highly correlated. 

Brief Description of the Drawings 

Figure 1 is a schematic diagram of a conventional 
stereophonic teleconferencing system. 

Figure 2 is a schematic diagram of a prior art 
stereophonic teleconferencing system in which small non- 
linearities have been added into the channel paths in order to 
selectively reduce the correlation between the individual 
channel signals. 

Figure 3 is a schematic diagram in accordance with an 
illustrative embodiment of the present invention. 

Figure 4A is a graphical diagram illustrating convergence 
of the mean square error (MSE) of a stereophonic 
teleconferencing system in accordance with a normalized least 
mean-squares (NLMS)echo cancellation algorithm of the prior 
art. 



PATENT 13 Docket No. Benesty 6-9 

Figure 4B is a graphical diagram illustrating 
misalignment in a stereophonic teleconferencing system 
employing echo cancellation in accordance with an NLMS echo 
cancellation algorithm of the prior art. 

Figure 5A is a graphical diagram illustrating convergence 
of the mean square error (MSE) of a stereophonic 
teleconferencing system in accordance with an FLRS echo 
cancellation algorithm of the prior art. 

Figure 5B is a graphical diagram illustrating 
misalignment in a stereophonic teleconferencing system 
employing echo cancellation in accordance with an FRLS echo 
cancellation algorithm of the prior art. 

Figure 6A is a graphical diagram illustrating convergence 
of the mean square error (MSE) of a stereophonic 
teleconferencing system in accordance with a first embodiment 
of the present invention. 

Figure 6B is a graphical diagram illustrating 
misalignment in a stereophonic teleconferencing system 
employing echo cancellation in accordance with the first 
embodiment of the present invention. 

Figure 7A is a graphical diagram illustrating convergence 
of the mean square error (MSE) in a stereophonic 
teleconferencing system in accordance with a second embodiment 
of the present invention. 

Figure 7B is a graphical diagram illustrating 
misalignment in a stereophonic teleconferencing system in 
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accordance with the second embodiment of the present 
invention* 



Detailed Description of the Invention 

Figure 3 is a schematic diagram of a stereophonic 
teleconferencing system in accordance with the present 
invention. It is essentially identical to Figure 2 except for 
the fact that the adaptive filters 15 and 16 have been 
replaced with adaptive filters 302 and 304. As will be 
explained in detail below, adaptive filters 302 and 304 
implement a frequency domain adaptive algorithm that tracks 
variations in the receiving room impulse responses h lf h 2/ 
respectively. 

1 . Introduction 

Since its first introduction by Dentino et al. [1], 
adaptive filtering in the frequency-domain has progressed very 
fast, and different sophisticated algorithms have since been 
proposed. Ferrara [2] was the first to elaborate an efficient 
frequency-domain least mean square adaptive filter algorithm 
(FLMS) that converges to the optimal (Wiener) solution. 
Mansour and Gray [3] derived an even more efficient algorithm, 
the unconstrained FLMS (UFLMS), using only three FFT 
operations, per block instead of five for the FLMS with 
comparable performances [4]. However, a major handicap of 
these solutions is delay. Indeed, this delay is equal to the 
length of the adaptive filter L, which is considerable for 
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some applications like acoustic echo cancellation (AEC) where 
the number of- taps can easily exceed one thousand. 

A new structure, using the classical overlap save (OLS) 
method, was proposed in [5] and [6] and generalized in [7] 
where the block processing N was made independent of the 
filter length L and N can be chosen as small as desired, with 
a delay equal to N. Although from the standpoint of 
complexity, the optimal choice is N = L, using smaller block 
sizes (N < L) in order to reduce the delay is still more 
efficient than the time-domain algorithms. A more general 
scheme based on weighted overlap and add (WOLA) methods, the 
generalized filter (GMDFot) , was proposed in [8], [9], where a 
is the overlap factor. The settings a > 1 appear to be very 
useful in the context of adaptive filtering, since the filter 
coefficients can be adapted more frequently (every N/a samples 
instead of every N samples in the standard OLS scheme) . So 
this structure introduces one more degree of freedom, but the 
complexity is increased by a factor of roughly a. Using a 
block size as large as the delay permits will increase the 
convergence rate of the algorithm, while taking an overlap 
factor greater than 1 will increase the tracking abilities of 
the algorithm. 

A new frequency-domain adaptive algorithm is derived 
using a frequency-domain recursive least squares criterion. A 
similar criterion was proposed in [3] using mathematical 
expectations instead. Here, however, an exact adaptive 
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algorithm is derived from the normal equation. The obtained 
algorithm is complex to implement . To reduce the complexity, 
a constraint is removed that will render exactly the UFLMS 
[3] . A scheme for selecting the optimal adaptation step for 

5 UFLMS is disclosed. Most importantly, we generalize all this 
to the multi-channel case. The obtained algorithm exploits 
the cross-power spectra among all the channels, which is very 
important (for a fast convergence rate) in multi-channel AEC 
where the input signals are highly correlated [10] . To 
1© simplify the presentation, we will derive all of the 

;^ algorithms only for L = N, a = 1, and with the OLS method. 

J^; Generalization to any other case is straightforward. 

In the equations below the following conventions should 

^. be noted: (1) lower case bold face variables denote vectors, 
iff; (2) upper case bold face variables denote matrices, (3) 

% underlined variables denote frequency-domain values, and (4) 

^ variables with hats denote estimates. 

2. Mono-Channel Frequency-Domain Adaptive Filtering 
Re -visited 

20 

In the time-domain, the general procedure to derive an 
adaptive algorithm is to first define an error signal, then to 
build a cost function based on the error signal, and finally 
to minimize the cost function with respect to the adaptive 
25 filter coefficients [11] . In the context of system 

identification, the error signal at time n between the system 
and model filter outputs is given by 

e(n) - y(n) - y (n) Eq. (1) 
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Eq. (2) 

is an estimate of the output signal y(n), 

h= [hO hj ... h L _^ 



is the model filter, and 

x(«) = [x(ri) x(n- 1) ... L+ l)] r 

is a vector containing the last L samples of the input signal 
x. Superscript 1 denotes the transpose of a vector or matrix. 
The recursive least squares (RLS) adaptive algorithm is 
obtained exactly from the normal equation which is derived by 
minimizing the following time-domain criterion [11] : 

n 

J t (n) = (1 - ;t t )£ ^ _P e 2 (p) , Eq. (3) 

p=0 

where X t (0 < X t < 1) is an exponential forgetting factor. In 
the rest of this section, we will follow the same approach. 
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where 



y(ri) = h T x(ri) 
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We now define the block error signal (of length N = L) 

as : 



y(m)= [y(mL) ... y(mL+ L- l)f 
y(m)= [x(mZ) ... x(wL + L-lfh 
= X r (m)h. 



It can easily be checked that X is a Toeplitz matrix of size 
(L x L) . 

It is well known that a Toeplitz matrix X can be 
transformed, by doubling its size, to a circulant matrix- 



e(m) = y(m) - y(m) , 



Eq. (4) 



where m is the block time index, and 




X' 



X 



c = 



X X' 



where X' also is a Toeplitz matrix. (The matrix X 1 can be 
expressed in terms of the elements of X, except for an 
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arbitrary diagonal.) Using circulant matrices, the block 
error signal can be re-written equivalently: 



"o Lxl " 




"o Lxl " 






y(m) 



- Wy'(m) , 



Eq. (5) 



where 



and 



w = 



^LxL ^LxL 



y'(ffi) = C(m) 



'Lxl 



Eq. (6) 



It is also well known that a circulant matrix is easily 
decomposed as follows: 



10 C = F^DF, 

where F is the Fourier matrix of size (2Lx2L) and D is a 
diagonal matrix whose elements are the discrete Fourier 
transform of the first column of C. If we multiply Eq. (5) by 
F, we get the error signal in the frequency domain: 
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where 



^m) = y(m) - Gy\m) 
= y(m) - GD(m)h, 



Eq. 



e(m) = F 



y(m) = P 



G = 
y'(m) 



w Lxl 

e(ro) 



y(m) 
,-i 



FWF 

= Fy'(m) , 



h = F 



'Lxl 



Having derived a frequency-domain error signal, we now 
define a frequency-domain criterion which is similar to Eq. 
(3) : 
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J f (m) = (1 - l t ) 



X *r v 



(P) e (p) , 



Eq. (8) 



p=0 



where H denotes conjugate transpose ♦ Let V be the gradient 



operator (with respect to h ) • Applying the operator V to 



the cost function J f , we obtain (noting that G H G = G 2 = G) the 
complex gradient vector: 
djf (m) 



Vj _<m) = 



dh(m) 



m 



= - (1- X £ )£ 1™ p D(P)G*y*( P ) 



Eq. (9) 



+ (1 - If) 



p=0 
m 



£ X»-PD(p)0*D*(p) 



p = 0 



h (m) , 



where * denotes complex conjugate. By setting the gradient of 
the cost function equal to zero, conjugating, and noting that 

Gy(p) = Y(p) / we obtain the so-called normal equation: 
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S (m) h (m) = s (m) , Eq. (10) 

where 

xa 



S (u) = (1 - X f ) ^ P D*(p) GD(p) 



Eq. (11) 
p = 0 



* . 



= X t S (m - 1) + (1 - X f ) D (m)GD(m) 



and 



s (m) = (1 - If) " P D*(p) y (p) 

p=0 

= % f s (m - 1) + (1 - X t ) D*(m) y (m) 



Eq. (12) 



It can be shown that, if the covariance matrix of the input 
signal is of rank L, then the matrix S (m) is nonsingular [3], 
[4] . In this case, the normal equation has a unique solution 
which is the optimal Wiener solution. 

Enforcing the normal equation at block time indices m and 
in - 1, and using Eq, (11) and Eq. (12), we easily derive an 
exact adaptive algorithm: 
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e (m) = y (m) - GD (m) h (m - 1) 



Eq. (13) 



h(in) = h(m - 1) + (l - X f ) S 1 (m) D*(m) e (m) . 



Eq. (14) 



This will be called the constrained algorithm. Note that this 
definition is different from the original frequency-domain 
adaptive algorithm proposed by Ferrara [2], (The constraint 
here is on the update of the matrix S while, in Ferrara 1 s 
algorithm, the constraint is on the update of the coefficients 
of the filter) . It can be shown that the convergence of the 
proposed algorithm for stationary signals does not depend on 
the statistics of the input signal which is, of course, a very 
desirable feature. 

Frequency-domain adaptive algorithms were first 
introduced to reduce the arithmetic complexity of the LMS 
algorithm [2] . Unfortunately, the matrix S is not diagonal, 
so the proposed algorithm has a high complexity and may not be 
practical to implement. If, however, 2G can be well 
approximated by the identity matrix, we then obtain the 
following unconstrained algorithm. 



S u (m) 



= l f S u (m - 1) + (l - A, f ) D*(m) D (m) 



Eq. (15) 



h(m) = h(m - 1) + |i u s u 1 ( m ) D *(*0 ©(at) 



Eq. (16) 
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where S u is now a diagonal matrix and ]jl u = 2(1-X f ) is a 
positive number* This algorithm is exactly the unconstrained 
frequency-domain adaptive filter proposed by Mansour and Gray 
[3] and since S u is diagonal, this algorithm is very attractive 
from a complexity point of view. Below, it is shown that this 
approximation is justified. Also, below we disclose the 
optimum value for p- u . 

Let us examine the structure of the matrix G. We have: 
G* = F _1 WF; since W is a diagonal matrix, and G is a circulant 
matrix. Therefore, inverse transforming the diagonal of W 
gives the first column of G*, 



g 



* 




T 



= F 1 [o . . .0 1 . . .l] T . 



The elements of vector g can be written explicitly as: 



2L-1 




1=L 



Eq. (17) 




exp 




1=0 
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we have: 
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-1. Since g k is the sum of a geometric progression, 



gk = 



0.5 



2L 



k = 0 

1 - exp(-iTCk) 
1 - exp(-i7ik / L) 



0.5 

0 

-1 
2L 



7tk 

1 - x cot ( ) 

2L 



k=0 
k even 

k odd, 



Eq. (18) 



where L-l elements of vector gr are equal to zero. Moreover, 
since G H G=G, then tf*g=gb=0 . 5 and we have 



2L-1 L-l 



gHg " g o = X |gx|2 = 2 E l gl l 2 = 7 Eq - < 19) 



1=1 1=1 
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We can see from Eq. (19) that the first element of vector g, 
i.e., g 0 , is dominant in a mean-square sense, and from Eq. (18) 
that the first L elements of g decrease rapidly to zero as k 
increases. Because of the conjugate symmetry, some of the 
last elements of g are non-negligible. However, this is of 
little concern since G is circulant with g as its first column 
and its other columns have those non-negligible elements 
shifted in such a way that they are concentrated around the 
main diagonal. 

To summarize, only the first few off-diagonals of G will 
be non-negligible, while the others can be completely 
neglected. Thus, approximating G by a diagonal matrix, i.e., 
G » g 0 I=I/2, is reasonable, and in this case we will have 

U u « (l-X f )/g 0 =2(l-X f ) 

for an optimal convergence rate. Note that this is in 
agreement with previous derivations [2] that give an optimal 
step size of 1-X f divided by a power normalizing factor, which 
for our assumed unit-power signal with L-sample zero padding 
is equal to 1/2. 

3. Generalization to the Multi-Channel Case 

The generalization to the multi-channel case is rather 
straightforward. Therefore, this section only highlights some 
important steps and states the algorithms. For convenience, 
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we will use the same notation as previously employed. Let J 
be the number of channels. Our definition of multi-channel is 
that we have a system with J input signals x u j=l, , 2 . . . , J and 
one output signal y. Now the block error signal is defined 
as: 



J 

e(m) = y(m) - ^ xT(m) hj, Eq . (20 ) 

j = l 



where e and y are vectors of and y^ respectively, all 
matrices X 3 are Toeplitz of size (LxL) , and is the 

estimated impulse response of the jth channel. In the 
frequency-domain, we have [c.f. Eq. (7)]: 

J 

e(m) = y(m) - Dj(m) h. 

j = 1 3 Eq. (21) 

= y(m) - GD (m) h , 
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where D = [D x D 2 ... Dj] is a (2Lx2LJ) matrix containing all the 



J diagonal matrices Da and 



T c_ T 
^2 



T is the (2LJxl) 



vector of concatenated, transformed, zero-padded estimated 
impulse responses. Minimizing the criterion defined in Eq. 
(8), we obtain the normal equation for the multi-channel case: 



S (m) h(m) = S (m) 



Eq. (22) 



m 



S(m) = (1- Xf)^ X? P C H (p)GD(p) 
p = 0 

= l±S (m - 1) + (1 - X f ) D H (m) GD (m) 



Eq. (23) 



is a 2LJx2LJ) matrix and 



m 



S(ltl) = (1 - X f )Y ^ P D H (p) jr(p) 

^ Eq. (24] 



p=0 

= X f s (m - 1) + (1 - X f )D H (m) jr (m) 
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is a (2LJ x 1) vector. Using the same approach and 
definitions as in Section 2, we obtain the multi-channel, 
constrained, frequency-domain, adaptive algorithm: 

£(m) = y{m) - GD(m)h(m - 1) Eq. (25) 



h(m) = h (m - 1) + (1 - X t ) S 1 (m) D H (m) e (m) Eq. (26) 

and the multi-channel unconstrained frequency-domain adaptive 
algorithm: 

S u (m) = Xf S n (m - 1) + (1 - l f )I? H (ia) D (m) Eq. (27) 

h (m) = h (m - 1) + jiuSu 1 (m) D H (m) e (la) • Eq. (28) 

Now, S u is not a diagonal matrix, but a block matrix containing 

J 2 diagonal matrices that are estimates of the power spectra 

and cross-power spectra of all the input signals. 

Particular case: The tiro-channel unconstrained frequency- 
domain adaptive algorithm. 

We easily deduce the algorithm from Eq. (27) and Eq. 

(28) : 

= Y( m ) - G \d x (m) h 1 (m - 1) + D 2 (m) h 2 (m - 1)1 Eq. (29) 
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h.Xm) = h^/w- l)+// M S^(m)[D;(w)- S u (w)Sj(»i)D;(w)]e(w) Eq. (30) 



h 2 (m) = h 2 (m - 1)+ n u <m) [dJ < m > " s 2,l( m ) S l,i( m ) D i (m)] e (m) Eq. (31] 



where S jrl are the (diagonal) sub-matrices of S u , 



Sj (m) = (m) [j 2L x2L " T H (m) T (m) 



, j = 1,2 Eq. (32) 



and 



r(m) = (m) S 2/2 (m) 1/2 S 1/2 (m) 



Eq. (33) 



is the coherence matrix. This algorithm is exactly the same 
as in [13], exploiting the coherence between the two channels 
in order to improve the convergence rate of the adaptive 
filter. 

For simplicity, we have derived the multi-channel case 
assuming N = L and no overlap (a = 1) . It is easy to 
generalize this for ot>l by simply computing the FFTs using 
overlapped data, which is exactly what is done in the next 
section for the application example (oc=4) . Furthermore, it is 
straightforward, although tedious, to generalize to the case 
of N<L [8] . 
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4 . Application to Acoustic Echo Cancellation and Simulations 

Multi-channel acoustic echo cancellation (AEC) is 
typically intended for use in high quality teleconferencing 
systems and in multi participant desktop conferencing, 
implementing sound transmission through at least two channels. 
Multi-channel AEC can be viewed as a simple generalization of 
the single-channel acoustic echo cancellation principle. 
Figures 1, 2 and 3 show this technique, in the two-channel 

(stereo) case, for one microphone in the receiving room, (which 
is- represented by the two echo paths A a and h 2 between the two 
loudspeakers and the microphone) . The two reference signals x 2 
and x 2 from the transmission room are obtained from two 
microphones in the case of teleconferencing. These signals 
are derived by filtering from a common source and this gives 
rise to the non-uniqueness problem discussed in the Background 
section that does not arise for the single-channel AEC. Also, 
as previously noted, the usual adaptive algorithms, therefore, 
converge to solutions that depend on the impulse responses in 
the transmission room. This means that for good echo 
cancellation one must track not only the changes in the 
receiving room, but also the changes in the transmission room 

(for example, when one person stops talking and another person 
starts) . The same problem occurs for multi-channel desktop 
conferencing, where multi-channel sound is synthesized from 
the single-microphone signals of all the participants [12] . 
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Also as discussed in the Background section, U.S. Patent 
No. 5,828,756 proposes a simple but efficient solution that 
overcomes the above-noted problem by adding a small non- 
linearity into each channel (Figures 2 and 3) . The distortion 
5 due to the non-linearity is hardly perceptible for speech yet 
it reduces inter-channel coherence thereby allowing reduction 
of misalignment to a low level. However, this solution is 
fruitful only when combined with the multi-channel FRLS 
algorithm which implies a high level of computational 
1|§ complexity such that a real time implementation is difficult. 
J: Aforementioned U.S. Patent Application No. 09/395,834 
\g discloses an alternate, frequency-domain based, algorithm for 
M generating the acoustic echo cancellation signal based on an 
* extended least mean squares (ELMS) scheme. The present 
lfSj invention is an alternative efficient multi-channel frequency- 
m domain adaptive method and apparatus that is even less 
computationally burdensome. 

We now show, by way of simulation using two channels, 
that the proposed unconstrained frequency-domain adaptive 
20 filter is a good alternative to some classical ' algorithms, 

namely, the two-channel Normalized Least Mean Squares (NLMS) 
and the two-channel Fast Recursive Least Squares (FRLS) . The 
signal source s in the transmission room is a 10 s speech 
signal. The two microphone signals x x and x 2 were obtained by 
25 convolving s with two impulse responses g lf g 2 of length 4096, 
as measured in an actual room. The microphone output signal y 
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in the receiving room is obtained by summing the two 
convolutions (A, * Xj ) and. h 2 * x 2 ) , where h 2 and h 2 also were 
measured in an actual room as 4096-point responses. A white 
noise signal with 45 dB SNR is added to y. The sampling rate 
is 16 kHz. The length of the two adaptive filters is taken as 
L = 1024. In all of the simulations, we added a half-wave 
rectifier non linearity (with gain of 0.5) to the signals x 2 
and x 2 . For the proposed algorithm, we used the following 
parameters: N = 1024, and a =4 (which implies an overall 
delay equal to 1024 samples, i.e., 64 ms) . With these values 
of N and a, the proposed algorithm is 7 times less complex 
than the two-channel NLMS and 40 times less complex than the 
two-channel FRLS . 

Figures 4A, 5A and 6A show the convergence of the Mean 
Square Error (MSE) for the prior art two channel Normalized 
Least Mean Squares (NLMS) solution, the prior art two-channel 
Fast Recursive Least Squares (FRLS) solution and the solution 
of the present invention, respectively. 

Figures 4B, 5B and 6B show the misalignment for the same 
three cases, respectively. 

For the purpose of smoothing the curves, error and 
misalignment samples were averaged over 128 points. For the 
FRLS algorithm, we chose a value = 1 - 1/ (20L) where L = 
1024 in this case. Accordingly, for the block frequency- 
domain algorithm, we chose X f = = 0.95 in order to have 
the same effective window length. 
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It can be seen that the proposed algorithm outperforms 
the other two with respect to misalignment while also being 
much less complex to implement. Also, the steady-state 
attenuation of the MSE for the frequency-domain algorithm 
appears to be as good as that for the FRLS (and somewhat 
better than NLMS) as confirmed by informal listening tests • ' 
However, the initial convergence rate of the proposed 
algorithm is somewhat slower than that of the FRLS. This can 
be improved by using a small exponential forgetting factor as 
shown in Figures 7A and 7B which show the convergence of the 
MSE and the misalignment, respectively, for the present 
invention with A f=s 0.9, but the misalignment is then increased 
somewhat. The optimal tradeoff between convergence rate and 
misalignment is very subjective and application dependent. 

For clarity of explanation, the illustrative embodiments 
of the present invention described herein were presented as 
comprising individual functional blocks. The functions that 
these blocks represent may be provided through the use of 
either shared or dedicated hardware, including, but not 
limited to, hardware capable of executing software. For 
example, the functions of the blocks presented in the various 
illustrative figures may be provided by a single shared 
processor. (Use of the term "processor" should not be 
construed to refer exclusively to hardware capable of 
executing software.) Potential embodiments may comprise 
digital signal processor (DSP) hardware, read-only memory 
(ROM) for storing software performing the operations discussed 
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above, and random access memory (RAM) for storing DSP results. 
Very large scale integration (VLSI) hardware embodiments, as 
well as custom VLSI circuitry in combination with a general 
purpose DSP circuit, may also be provided. 
5 . Conclusions 

We have derived a class of multi-channel frequency-domain 
adaptive algorithms from a frequency-domain recursive least 
squares criterion. The constrained algorithm was deduced 
directly and exactly from the normal equation and, in this 
sense, is optimal, while the unconstrained version is a good 
approximation. Most importantly, both algorithms exploit the 
cross-power spectra (or equivalently the cross-correlations in 
the time-domain) among all the channels and this feature is 
fundamental for the algorithms to converge rapidly to the 
Wiener solution, especially for applications like multi- 
channel AEC where the channels are highly correlated. 

While specific embodiments of the invention have been 
described in connection with acoustic echo cancellation in a 
multiple channel teleconferencing system, it should be 
understood that this is merely an exemplary application and 
that the invention has much broader applicability. The 
invention, for instance, can be used in connection with 
virtually any multiple channel adaptive filtering application, 
such as multi-channel equalizers. 

Having thus described a few particular embodiments of the 
invention, various alterations, modifications, and 
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improvements will readily occur to those skilled in the art. 

Such alterations, modifications and improvements as are made 
obvious by this disclosure are intended to be part of this 
description though not expressly stated herein, and are 
intended to be within the spirit and scope of the invention. 
Accordingly, the foregoing description is by way of example 
only, and not limiting. The invention is limited only as 
defined in the following claims and equivalents thereto. 
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CLAIMS 



1* A method of adaptively filtering a signal transmitted 
over a channel, said signal containing an input signal and 
multiple impulse responses, said multiple impulse responses to 
be adaptively filtered, said method comprising the steps of: 

generating an estimate of an impulse response 
corresponding to each of said multiple impulse responses; 

generating a sum of said estimates; and 

generating an error signal representing the difference 
between said signal and said sum of said estimates; 

wherein said estimates are generated using a frequency 
domain recursive least squares algorithm. 

2. The method set forth in claim 1 wherein each of said 
estimates is generated by diagonally decomposing by Fourier 
transformation a circulant matrix formed by augmentation of 
said input signal. 

3. The method set forth in claim 1 wherein said step of 
generating said estimates comprises, for each of said 
estimates, the steps of: 

forming a matrix of vectors representing said input 
signal; 

augmenting said matrix to form a circulant matrix;, and 
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decomposing said circulant matrix by Fourier 
transformation to form a diagonal matrix, D. 



4. The method set forth in claim 3 wherein said step of 
generating said estimate further comprises the step of 
generating each of said estimates via the equation: 



S(m) = (1 - fcf)^ ^f~ P I> H (p)<?D(p) 
p=0 

= XfS (m - 1) + (1 - X f ) D H (m) GD (m) 



5. The method set forth in claim 3 wherein each of said 
estimates is generated via the equation: 

h (m) = h(m - l) + M^u^ 1 (m) JD H (m) e (m) . 



h (m) = h (m - 1) + (1 - l f ) S 1 (m) D H (m) e (m) 



where 



e(m) = y{m) - GD (m)h (m - l) , and 




where 
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e(m) = y{m) - GD(m)h(m - 1) , and 

S u (m) = X t S u (m - 1) + (1 - lf)D K (m) D (m) . 

6. A method for transmitting a signal over a channel in 
a multiple channel communication apparatus where said signal 
includes an input signal and multiple impulse responses 
wherein said multiple impulse responses are adaptively 
filtered, said method comprising the steps of: 

transmitting said signal over a channel, wherein said 
signal includes an input signal and at least first and second 
impulse responses; 

generating an estimate of an impulse response 
corresponding to each of said first and second impulse 
responses; 

generating a sum of said estimates; and 

generating an error signal representing the difference 

between said signal and sum of said estimates; 

wherein said estimate are generated using a frequency 

domain recursive least squares algorithm. 

7. The method set forth in claim 6 wherein each of said 
estimates is generated by diagonally decomposing by Fourier 
transformation a circulant matrix formed by augmentation of 
said input signal. 
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8. The method set forth in claim 1 wherein said step of 
generating said estimates comprises, for each of said 
estimates, the steps of: 

forming a matrix of vectors representing said input 
signal; 

augmenting said matrix to form a circulant matrix; and 
decomposing said circulant matrix by Fourier 
transformation to form a diagonal matrix, D. 

9. The method set forth in claim 8 wherein each of said 
estimates is generated via the equation: 



h (m) = h (m - 1) + (l - X t ) S 1 (m) D H (m) e (m) 



where 



e (m) = y(m) - GD (m)h (m - 1) , and 




p=0 

= XfS (m - 1) + (1 - X t ) D H (m) GD (m) 



10. The method set forth in claim 8 wherein each of said 
estimates is generated via the equation: 
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h (m) = h (m - l) + ^S^ 1 (m) D R (m) e(m) • 

where 

e(in) = J^m) - GD (m)h(m - 1) , and 

S u (m) = X f S u (m - l) + (l - X f )D H (m) D (m) . 

11. An apparatus for transmitting a signal over a 
channel in a multiple channel communication apparatus where 
said signal includes an input signal and multiple impulse 
responses, wherein said multiple impulse responses are to be 
adaptively filtered, said apparatus comprising: 

a transmitter for generating a data signal for 
transmission via a communication channel, wherein said signal 
includes an input signal and multiple impulse responses, 
wherein said multiple impulse responses are to be adaptively 
filtered; 

an adaptive filter circuit for generating an estimate of 
an impulse response corresponding to each of said first and 
second impulse responses; 

a subtracter circuit for generating an error signal 
representing the difference between said data signal and a sum 
of said estimates; 
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wherein said estimates are generated using a frequency 
domain recursive least squares algorithm. 



12. The apparatus set forth in claim 11 wherein said 
adaptive filter generates each of said estimates by diagonally 
decomposing by Fourier transformation a circulant matrix 
formed by augmentation of said input signal. 

13. The method set forth in claim 11 wherein said 
adaptive filter comprises: 

a circuit for forming with respect to each of said 
impulse responses a matrix of vectors representing said input 
signal/ 

a circuit for augmenting said matrix with respect to each 
of said impulse responses to form a circulant matrix; and 

a circuit for decomposing said circulant matrix with 
respect to each of said impulse responses by Fourier 
transformation to form a diagonal matrix, D* 

14. The apparatus set forth in claim 13 wherein each of 
said estimates is generated via the equation: 

h (m) = h (m - 1) + (1 - X f ) S~ 1 (m) D H (m) e (m) 

where 

e(m) = - C?D(m)h(m - 1) , and 
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S(m) = (1 - X f X* p D H (p)GD(p) 

p=0 

= X f S (m - 1) + (1 - A, f ) D K (m) GD (m) 



15. The apparatus set forth in claim 13 wherein each of 
said estimates is generated via the equation: 

h(m) = h(m - 1) + M-u^u 1 (*) D ^ W £ ( m ) * 



S n (m) = Xf S u (m - 1) + (l - X f )r> n (m) D (m) . 

16. The apparatus set forth in claim 15 wherein said 
adaptive filter circuit comprises a microprocessor. 



where 



e(m) = y\m) - GD(m)h(m ~ l) , and 



17. The apparatus set forth in claim 16 wherein said 
subtracter circuit comprises said microprocessor. 
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18, A method of multi-channel communication between at 
least first and second locations, said method comprising the 
steps of: 

transmitting multiple channels of information upstream 
from said first location to said second location; 

transmitting at least one additional channel of 
information downstream from said second location to said first 
location; 

generating estimates of impulse responses corresponding 
to distortion paths at said second location coupled between 
each of said multiple upstream channels and said downstream 
channel; and 

generating an error signal representing the difference 
between a desired signal on said downstream channel and a sum 
of said estimates and transmitting said error signal to said 
first location; 

wherein said estimate is generated using a frequency 
domain recursive least squares algorithm. 

19. The method set forth in claim 18 wherein said step 
of generating said estimates comprises generating, by 
diagonally decomposing by Fourier transformation, a circulant 
matrix formed by augmentation of a signal on each of said 
upstream channels. 
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20. The method set forth in claim 18 wherein said step 
of generating said estimates comprises, for each of said 
estimates, the steps of: 

forming a matrix of vectors representing a signal on said 

upstream channel; 

augmenting said matrix to form a circulant matrix; and 
decomposing said circulant matrix by Fourier 

transformation to form a diagonal matrix, D. 

21. The method set forth in claim 20 wherein each of 
said estimates is generated via the equation: 



h (m) = h (m - 1) + (1 - X f ) S x (m) D n (m) e (m) 



where 



e(m) = y{m) - GD(m)h(m - 1) , and 




p=0 

= %£ S (m - 1) + (1 - If) D H (m) GD (m) 



22. The method set forth in claim 20 wherein each of 
said estimates is generated via the equation: 
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h (m) = h(ra - 1) + Ru^u 1 ( m ) D ^ ( m ) ± ( m ) • 

where 

>\ 

e(m) = J[m) - GD(m)h(m - l) , and 

S u (m) = Ji f S u (m - 1) + (1 - X f )D H (m) D (m) . 



23. The method of claim 22 further comprising the step 

of: 

introducing a non-linear transformation into at least one 
of said multiple upstream channels. 

24. The method as set forth in claim 23 wherein each of 
said estimates comprises generating a model of a distortion 
path at said second location from said corresponding upstream 
channel to said downstream channel and wherein said step of 
generating said estimates further comprises the steps of: 

convolving each of said estimates with a signal on the 
corresponding one of said upstream channels to generate an 
estimate for each individual one of said upstream channels; 
and 

summing each of said individual estimates. 
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25. The method set forth in claim 20 wherein said 
multiple channels of upstream information comprise sound 
generated at said first location and said distortion paths 
comprise echo paths at said second location coupled between 
each of said multiple upstream channels and said downstream 
channel . 

26. A method of canceling distortion in a communication 
system having multiple transmission channels from a first 
location to a second location and at least one transmission 
channel from said second location to said first location, said 
method comprising the steps of: 

developing an estimated impulse response corresponding to 
each of said multiple upstream channels that models an 
interference path at said second location from said 
corresponding upstream channel to said downstream channel; 

convolving each of said estimated impulse responses with 
a signal on the corresponding one of said upstream channels to 
generate an estimate corresponding each of said upstream 
channels; and 

summing each of said individual estimates; 

wherein said estimate is generated using a frequency 
domain recursive least squares algorithm. 

27. The method set forth in claim 26 wherein said step 
of developing said estimated impulse responses comprises 
generating, by diagonally decomposing by Fourier 
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transformation, a circulant matrix formed by augmentation of a 
signal on each of said upstream channels. 



28. The method set forth in claim 26 wherein said step 
of developing said estimated impulse responses comprises, for 
each of said estimated impulse responses, the steps of: 

forming a matrix of vectors representing a signal on said 

upstream channel; 

augmenting said matrix to form a circulant matrix; and 
decomposing said circulant matrix by Fourier 

transformation to form a diagonal matrix, D. 

29. The method set forth in claim 28 wherein each of 
said estimates is generated via the equation: 



h(m) = h (m - 1) + (1 - X f ) S 1 (m) P H (m) e (m) 



where 



£(m) = y(m) - GD (m)h (m - 1) , and 



S(m) = (1 - X f >2^ ^~ p D H (p)GD(p) 

= X t S (m - 1) + (1 - Xf ) Z> H (m) GD (m) 
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30. The method set forth in claim 28 wherein each of 

said estimates is generated via the equation: 

h (m) = h (m - 1) + ^S^ 1 (m) X> H (m) e (m) • 

where 

e(m) = J(a) - GD(i)h(m - l) , and 

S n (m) = Xf S u (m - 1) + (1 - Xf )P H (m) D (m) . 

31. The method as set forth in claim 30 wherein said 
multiple upstream channels comprise a first channel and a 
second channel. 

32. The method set forth in claim 30 wherein each of 
said estimates is generated via the equation: 

h (m) = h (m - 1) + (1 - X t ) S" 1 (m) D H (m) e (m) 

where 

e ( m) = JTm) ~ GD (m)h(m - 1) , and 
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S(m) = (1 - ^f)^ *f D (P)AD(P) 

p = 0 

= X f S (m - 1) + (1 - X f ) D H (m) GD (m) 



33, A method of canceling acoustic echo in a 
communication system having multiple transmission channels 
from a first location to a second location and at least one 
transmission channel from said second location to said first 
location, said method comprising the steps of: 

developing an estimated impulse response corresponding to 
each of said multiple upstream channels that models an echo 
path at said second location from said corresponding upstream 
channel to said downstream channel; 

convolving each of said estimated impulse responses with 
a signal on the corresponding one of said upstream channels to 
generate an estimate corresponding each of said upstream 
channels; and 

summing each of said individual estimates; 

wherein said estimate is generated using a frequency 
domain recursive least squares algorithm. 



34. The method set forth in claim 33 wherein said step 
of developing said estimated impulse responses comprises 
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generating, by diagonally decomposing by Fourier 
transformation, a circulant matrix formed by augmentation of a 
signal on each of said upstream channels. 

35. The method set forth in claim 33 wherein said step 
of developing said estimated impulse responses comprises, for 
each of said estimated impulse responses, the steps of: 

forming a matrix of vectors representing a signal on said 

upstream channel; 

augmenting said matrix to form a circulant matrix; and 
decomposing said circulant matrix by Fourier 

transformation to form a diagonal matrix, D. 

36. The method set forth in claim 35 wherein each of 
said estimated impulse responses is generated via the 
equation: 



h (m) = h (m - 1) + (1 - X f ) S x (m) D (m) e (m) 



where 



e(m) = y[m) - GD (m)h (m - 1) , and 



m 




p=0 

= XfS (m - 1) + (1 - If) D K (m) GD (m) 
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37, The method set forth in claim 35 wherein each of 
said estimated impulse responses is generated via the 
equation: 

h (m) = h (m - 1) + HuS^ 1 (m) D H (m) e (m) . 



5 where 

e(m) = y(m) - GD(m)h(m - 1) , and 

H S u (m) = X f S u (m - 1) + (1 - If )D H (m) I> (m) • 

Cf 38. A mult i- channel teleconferencing apparatus 

comprising: 

113 at least first and second upstream electrical paths 

<a between a first location and a second location for 

transmitting acoustic signals from said first location to said 

second location; 

at least one downstream electrical path between said 
15 second location and said first location for transmitting 

acoustic signals from said second location to said first 

location; 

at least one non-linear transformation module coupled 
within each of one or more of said upstream paths; 
20 a finite impulse response filter coupled between said 

upstream paths and said downstream path for generating an 
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estimate of an impulse response corresponding to echo paths at 
said second location coupled between said at least first and 
second upstream channels and said downstream channel, said 
estimate being generated using a frequency domain recursive 
least squares algorithm; and 

a difference circuit for generating an error signal 
representing the difference between a signal on said 
downstream channel representing sound at said second location 
and said estimate, 

39. The apparatus set forth in claim 38 wherein said 
finite impulse response filter generates each of said 
estimates by diagonally decomposing by Fourier transformation 
a circulant matrix formed by augmentation of an input signal 
on a corresponding one of said upstream electrical paths* 

40. The method set forth in claim 38 wherein said finite 
impulse response filter comprises: 

a circuit for forming a matrix of vectors representing a 
an input signal on a corresponding one of said upstream 
electrical paths; 

a circuit for augmenting each of said matrices with 
respect to each of said impulse responses to form a circulant 
matrix; and 

a circuit for decomposing each of said circulant matrices 
with respect to each of said impulse responses by Fourier 
transformation to form a diagonal matrix, D. 
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41. The method set forth in claim 40 wherein each of 
said estimates is generated via the equation: 

h (m) = h (m - 1) + (1 - l f ) S _1 (m) D H (m) e (m) 



S(m) = (1 - ^f)^ *?~ P S H (p) GD(p) 
p=0 

= XfS (m - 1) + (1 - X t ) D H (m) GD (m) 



42. The method set forth in claim 40 wherein each of 
said estimates is generated via the equation: 

h (m) = h(m - 1) + HuS^ 1 (m) Z> H (m) e (m) . 



where 



= J^m) - GD(m)h(m - l) , and 



m 




where 



e ( m ) = y{ m ) - GD ( m )h ( m - 1) , and 



S u (m) = If S u (m - 1) + (1 - If )D a (m) D (m) . 
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43. The apparatus as set forth in claim 42 wherein said 

finite impulse response circuit comprises: 

a finite impulse response circuit corresponding to each 
of said upstream channels for generating an impulse response 
that models an impulse response corresponding to an- echo path 
at said second location from said corresponding upstream 
channel to said downstream channel, each finite impulse 
response filter coupled between said corresponding upstream 
path and said downstream path. 

44. The apparatus as set forth in claim 43 further 
comprising: 

a summing circuit for summing said estimates. 

45. The apparatus as set forth in claim 44 further 
comprising; 

at least first and second microphones at said first 
location for receiving sound, said microphones coupled to said 
first and second upstream electrical paths, respectively; 

at least first and second- speakers at said second 
location coupled to said first and second upstream electrical 
paths , respectively, for re-creating said sound from said 
first location at said second location; 

at least a third microphone at said second location for 
receiving sound, said microphone coupled to said downstream 
electrical path; 
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at least a third speaker at said first location coupled 
to said downstream electrical path for re-creating said sound 
from said second location at said first location* 

46. An apparatus for performing echo cancellation in a 
multi-channel teleconferencing system comprising at least 
first and second upstream electrical paths between a first 
location and a second location for transmitting acoustic 
signals from said first location to said second location and 
at least one downstream electrical path between said second 
location and said first location for transmitting acoustic 
signals from said second location to said first location, said 
apparatus comprising; 

at least one non-linear transformation module for 
coupling within each of one or more of said upstream paths; 

a finite impulse response filter for coupling between 
said upstream paths and said downstream path for generating an 
estimate of an impulse response corresponding to echo paths at 
said second location coupled between each of said multiple 
upstream channels and said downstream channel in which said 
estimate is generated using a frequency domain recursive least 
squares algorithm. 

47. The apparatus set forth in claim 46 wherein said 
finite impulse response filter generates each of said 
estimates by diagonally decomposing by Fourier transformation 
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a circulant matrix formed by augmentation of an input signal 
on a corresponding one of said upstream electrical paths* 



48. The method set forth in claim 46 wherein said finite 
impulse response filter comprises: 

a circuit for forming a matrix of vectors representing a 
an input signal on a corresponding one of said upstream 
electrical paths; 

a circuit for augmenting each of said matrices with 
respect to each of said impulse responses to form a circulant 
matrix; and 

a circuit for decomposing each of said circulant matrices 
with respect to each of said impulse responses by Fourier 
transformation to form a. diagonal matrix, D* 

49. The method set forth in claim 4 8 wherein each of 
said estimates is generated via the equation; 

h (m) = h (m - 1) + (l - X t ) S~ X (m) D H (m) e (m) 

where ■ 



e(m) = y[n) - GD(m)h(m - 1) , and 
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Sfm) = (1 - Xf)^ lf' P D E (p)GD(p) 

= X f S (m - 1) + (1 - X t ) D H (m) GD (m) 



50. The method set forth in claim 48 wherein each of 
said estimates is generated via the equation: 

h (m) = h (m - 1) + [iySu 1 (m) D K (m) e(m) . 



S u (m) = X t S u (m - 1) + (1 - X t )D n (m) D (m) ♦ 

51. The apparatus as set forth in claim 50 further 
comprising: 

a difference circuit coupled to an output of said finite 
impulse response circuit for coupling to said downstream path 
for generating an error signal representing the difference 
between a signal on said downstream path representing sound at 
said second location and said estimate. 



where 



e(m) = y(m) - GD(m)h(m - 1) , and 
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52. The apparatus as set forth in claim 51 wherein said 
finite impulse response circuit comprises: 

multiple finite impulse response circuits for each of 
said upstream channels for generating an impulse response that 
models an impulse response corresponding to an echo path at 
said second location from said corresponding upstream channel 
to said downstream channel. 
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MULT I -CHANNEL FREQUENCY-DOMAIN ADAPTIVE 
FILTER METHOD AND APPARATUS 



Abstract of the Invention 

The invention is a method and apparatus for frequency- 
domain adaptive filtering that has broad applications such as 
to equalizers, but is particularly suitable for use in 
acoustic echo cancellation circuits for stereophonic and other 
multiple channel teleconferencing systems. The method and 
apparatus utilizes a frequency-domain recursive least squares 
criterion that minimizes the error signal in the frequency- 
domain ♦ In order to reduce the complexity of the algorithm, a 
constraint is removed resulting in an unconstrained frequency- 
domain recursive least mean squares method and apparatus, A 
method and apparatus for selecting an optimal adaptation step 
for the UFLMS is disclosed. The method and apparatus is 
generalized to the multiple channel case and exploits the 
cross-power spectra among all of the channels. 
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IN THE UNITED STATES 
PATENT AND TRADEMARK OFFICE 

Declaration and Power of Attorney 
As the below named inventor, I hereby declare that: 

My residence, post office address and citizenship are as stated below next to my 

name. 

I believe I am the original, first and sole inventor of the subject matter which is 
claimed and for which a patent is sought on the invention entitled MULTI-CHANNEL 
FREQUENCY-DOMAIN ADAPTIVE FILTER METHOD AND APPARATUS the 
specification of which is attached hereto. 

I hereby state that I have reviewed and understand the contents of the above 
identified specification, including the claims, as amended by an amendment, if any, 
specifically referred to in this oath or declaration. 

I acknowledge the duty to disclose all information known to me which is material 
to patentability as defined in Title 37, Code of Federal Regulations, 1.56. 

I hereby claim foreign priority benefits under Title 35, United States Code, 119 of 
any foreign application(s) for patent or inventor's certificate listed below and have also 
identified below any foreign application for patent or inventor's certificate having a filing 
date before that of the application on which priority is claimed: 

None 

I hereby claim the benefit under Title 35, United States Code, 120 of any United 
States application(s) listed below and, insofar as the subject matter of each of the claims 
of this application is not disclosed in the prior United States application in the manner 
provided by the first paragraph of Title 35, United States Code, 112, 1 acknowledge the 
duty to disclose all information known to me to be material to patentability as defined in 
Title 37, Code of Federal Regulations, 1.56 which became available between the filing date 
of the prior application and the national or PCT international filing date of this application: 

None 

I hereby declare that all statements made herein of my own knowledge are true and 
that all statements made on information and belief are believed to be true; and further that 
these statements were made with the knowledge that willful false statements and the like 
so made are punishable by fine or imprisonment, or both, under Section 1001 of Title 18 
of the United States Code and that such willful false statements may jeopardize the validity 
of the application or any patent issued thereon. 
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I hereby appoint the following attorney(s) with full power of substitution and 
revocation, to prosecute said application, to make alterations and amendments therein, to 
receive the patent, and to transact all business in the Patent and Trademark Office 
connected therewith: 



Lester H. Birnbaum (Reg. No. 25,830) 

Richard J. Botos (Reg. No. 32,016) 

Jeffery J. Brosemer (Reg. No. 36,096) 

Kenneth M. Brown (Reg. No. 37,590) 

Craig J. Cox (Reg. No. 39,643) 

Donald P. Dinella (Reg. No. 39,961) 

Guy Eriksen (Reg. No. 41,736) 

Martin I. Finston (Reg. No. 31,613) 

James H. Fox (Reg. No. 29,379) 

William S. Francos (Reg. No. 38,456) 

Barry H. Freedman (Reg. No. 26,166) 

Julio A. Garceran (Reg. No. 37,138) 

Mony R. Ghose (Reg. No. 38,159) 

Jimmy Goo (Reg. No. 36,528) 

Anthony Grillo (Reg. No. 36,535) 

Stephen M. Gurey (Reg. No. 27,336) 

John M. Harman (Reg. No. 38,173) 

Michael B. Johannesen (Reg. No. 35,557) 

Mark A. Kurisko (Reg. No. 38,944) 

Irena Lager (Reg. No. 39,260) 
Christopher N. Malvone (Reg. No. 34,866) 

Scott W. McLellan (Reg. No. 30,776) 

Martin G. Meder (Reg. No. 34,674) 

John C. Moran (Reg. No. 30,782) 

Michael A. Morra (Reg. No. 28,975) 

Gregory J. Murgia (Reg. No. 41,209) 

Claude R. Narcisse (Reg. No. 38,979) 

Joseph J. Opalach (Reg. No. 36,229) 

Neil R. Ormos (Reg. No. 35,309) 

Eugen E. Pacher (Reg. No. 29,964) 

Jack R. Penrod (Reg. No. 31,864) 

Daniel J. Piotrowski (Reg. No. 42,079) 

Gregory C. Ranieri (Reg. No. 29,695) 

Scott J. Rittman (Reg. No. 39,010) 

Eugene J. Rosenthal (Reg. No. 36,658) 

Bruce S. Schneider (Reg. No. 27,949) 

Ronald D. Slusky (Reg. No. 26,585) 

David L. Smith (Reg. No. 30,592) 

Patricia A. Verlangieri (Reg. No. 42,201)- 

John P. Veschi (Reg. No. 39,058) 

David Volejnicek (Reg. No. 29,355) 

Charles L. Warren (Reg. No. 27,407) 

Jeffrey M. Weinick (Reg. No. 36,304) 

Eli Weiss (Reg. No. 17,765) 
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I hereby appoint the attorney(s) on ATTACHMENT A as associate attorney(s) in 
the aforementioned application, with full power solely to prosecute said application, to 
make alterations and amendments therein, to receive the patent, and to transact all business 
in the Patent and Trademark Office connected with the prosecution of said application. No 
other powers are granted to such associate attorney(s) and such associate attorney(s) are 
specifically denied any power of substitution or revocation. 
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